Modular Graph Transformer Networks for Multi-Label Image Classification
نویسندگان
چکیده
With the recent advances in graph neural networks, there is a rising number of studies on graph-based multi-label classification with consideration object dependencies within visual data. Nevertheless, representations can become indistinguishable due to complex nature label relationships. We propose image framework based transformer networks fully exploit inter-label interactions. The paper presents modular learning scheme enhance performance by segregating computational into multiple sub-graphs modularity. proposed approach, named Modular Graph Transformer Networks (MGTN), capable employing backbones for better information propagation over different guided transformers and convolutions. validate our MS-COCO Fashion550K datasets demonstrate improvements classification. source code available at https://github.com/ReML-AI/MGTN.
منابع مشابه
Regional Gating Neural Networks for Multi-label Image Classification
This paper proposes a novel deep learning framework for multi-label image classification, namely regional gating neural networks (RGNN). The motivation is two folds. First, global image features (including CNN based features) ignore the underlying context information among different objects in an image. Consequently, people attempt to use information from objectness regions. However, current ob...
متن کاملMatrix Completion for Multi-label Image Classification
Recently, image categorization has been an active research topic due to the urgent need to retrieve and browse digital images via semantic keywords. This paper formulates image categorization as a multi-label classification problem using recent advances in matrix completion. Under this setting, classification of testing data is posed as a problem of completing unknown label entries on a data ma...
متن کاملProximity-based Graph Embeddings for Multi-label Classification
In many real applications of text mining, information retrieval and natural language processing, large-scale features are frequently used, which often make the employed machine learning algorithms intractable, leading to the well-known problem “curse of dimensionality”. Aiming at not only removing the redundant information from the original features but also improving their discriminating abili...
متن کاملMulti-label Image Classification with A Probabilistic Label Enhancement Model
In this paper, we present a novel probabilistic label enhancement model to tackle multi-label image classification problem. Recognizing multiple objects in images is a challenging problem due to label sparsity, appearance variations of the objects and occlusions. We propose to tackle these difficulties from a novel perspective by constructing auxiliary labels in the output space. Our idea is to...
متن کاملGraph Convolutional Networks for Classification with a Structured Label Space
It is a usual practice to ignore any structural information underlying classes in multi-class classification. In this paper, we propose a graph convolutional network (GCN) augmented neural network classifier to exploit a known, underlying graph structure of labels. The proposed approach resembles an (approximate) inference procedure in, for instance, a conditional random field (CRF), however wi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i10.17098